Automated film-making using image-based object tracking

ABSTRACT

A method for image capture. The method includes generating, by disposing a light sensing device (112) at one or more locations in a scene (140), a direction of a visible light source (118) from each of the one or more locations in the scene (140), generating, based at least on the direction of the visible light source (118) and a pre-determined image capture criterion, a physical configuration of the image capture, wherein the physical configuration comprises at least one selected from a group consisting of a target camera location (145b) and a target object location (144b) in the scene (140), and transmitting a command to a camera device (110) to capture an image of an object in the scene (140) based on the physical con-figuration of the image capture.

BACKGROUND

A movie script is a document, typically a work product from ascreenwriter, that specifies characters' movements, actions,expressions, and dialogues, as well as sound effects and other settingsof a movie film. The movie script may be based on various differentformats adopted by the movie industry. A field-of-view (FOV) is anextent of a scene that is imaged by a camera. An object inside the FOVwill appear in an image captured and/or outputted by the camera. Forexample, the FOV may correspond to a solid angle within which a cameralens projects light input to an optical sensor of the camera.

SUMMARY

In general, in one aspect, the invention relates to a method for imagecapture. The method includes generating, by disposing a light sensingdevice at one or more locations in a scene, a direction of a visiblelight source from each of the one or more locations in the scene,generating, based at least on the direction of the visible light sourceand a pre-determined image capture criterion, a physical configurationof the image capture, wherein the physical configuration comprises atleast one selected from a group consisting of a target camera locationand a target object location in the scene, and transmitting a command toa camera device to capture an image of an object in the scene based onthe physical configuration of the image capture.

In general, in one aspect, the invention relates to an image capturecontroller. The image capture controller includes a computer processorand memory storing instructions, when executed, causing the computerprocessor to generate, by disposing a light sensing device at one ormore locations in a scene, a direction of a visible light source fromeach of the one or more locations in the scene, generate, based at leaston the direction of the visible light source and a pre-determined imagecapture criterion, a physical configuration of the image capture,wherein the physical configuration comprises at least one selected froma group consisting of a target camera location and a target objectlocation in the scene, and transmit a command to a camera device tocapture an image of an object in the scene based on the physicalconfiguration of the image capture.

In general, in one aspect, the invention relates to a system for imagecapture. The system includes a light sensing device, a camera device,and an image capture controller configured to generate, by disposing thelight sensing device at one or more locations in a scene, a direction ofa visible light source from each of the one or more locations in thescene, generate, based at least on the direction of the visible lightsource and a pre-determined image capture criterion, a physicalconfiguration of the image capture, wherein the physical configurationcomprises at least one selected from a group consisting of a targetcamera location and a target object location in the scene, and transmita command to a camera device to capture an image of an object in thescene based on the physical configuration of the image capture.

In general, in one aspect, the invention relates to a non-transitorycomputer readable medium storing instructions for image capture. Theinstructions, when executed by a computer processor, comprisingfunctionality for generating, by disposing a light sensing device at oneor more locations in a scene, a direction of a visible light source fromeach of the one or more locations in the scene, generating, based atleast on the direction of the visible light source and a pre-determinedimage capture criterion, a physical configuration of the image capture,wherein the physical configuration comprises at least one selected froma group consisting of a target camera location and a target objectlocation in the scene, and transmitting a command to a camera device tocapture an image of an object in the scene based on the physicalconfiguration of the image capture.

Other aspects of the invention will be apparent from the followingdescription and the appended claims.

BRIEF DESCRIPTION OF DRAWINGS

FIGS. 1.1, 1.2, and 1.3 show schematic block diagrams of a system inaccordance with one or more embodiments of the invention.

FIGS. 2.1 and 2.2 show method flowcharts in accordance with one or moreembodiments of the invention.

FIGS. 3.1, 3.2, 4, 5, and 6 show various examples in accordance with oneor more embodiments of the invention.

FIGS. 7.1 and 7.2 show a computing system in accordance with one or moreembodiments of the invention.

DETAILED DESCRIPTION

Specific embodiments of the invention will now be described in detailwith reference to the accompanying figures. Like elements in the variousfigures may be denoted by like reference numerals for consistency.

In the following detailed description of embodiments of the invention,numerous specific details are set forth in order to provide a morethorough understanding of the invention. However, it will be apparent toone of ordinary skill in the art that the invention may be practicedwithout these specific details. In other instances, well-known featureshave not been described in detail to avoid unnecessarily complicatingthe description.

In the following description, any component described with regard to afigure, in various embodiments of the invention, may be equivalent toone or more like-named components described with regard to any otherfigure. For brevity, at least a portion of these components areimplicitly identified based on various legends. Further, descriptions ofthese components will not be repeated with regard to each figure. Thus,each and every embodiment of the components of each figure isincorporated by reference and assumed to be optionally present withinevery other figure having one or more like-named components.Additionally, in accordance with various embodiments of the invention,any description of the components of a figure is to be interpreted as anoptional embodiment which may be implemented in addition to, inconjunction with, or in place of the embodiments described with regardto a corresponding like-named component in any other figure. In thefigures, black solid collinear dots indicate that additional componentssimilar to the components before and/or after the solid collinear dotsmay optionally exist. Further, a solid line or a dash line connectingthe components of a figure represent a relationship between theconnected components. The dash line indicates that the relationship maynot include or otherwise associate with any physical connection orphysical element.

Throughout the application, ordinal numbers (e.g., first, second, third,etc.) may be used as an adjective for an element (i.e., any noun in theapplication). The use of ordinal numbers is not to imply or create anyparticular ordering of the elements nor to limit any element to beingonly a single element unless expressly disclosed, such as by the use ofthe terms “before”, “after”, “single”, and other such terminology.Rather, the use of ordinal numbers is to distinguish between theelements. By way of an example, a first element is distinct from asecond element, and the first element may encompass more than oneelement and succeed (or precede) the second element in an ordering ofelements.

In one or more embodiments of the invention, a direction of a visiblelight source from one or more locations in a scene is generated using alight sensing device. A movie script is analyzed to determinepre-determined image capture criteria for image frames of a movie. Theimage capture criterion of each image frame includes lighting conditionssuch as front lighting, side lighting, and back lighting, as well asimage capture types such as close-up, half-portrait, full-portrait, andwide-angle. While the description herein only illustrates the abovelighting conditions and image capture types as examples, other lightingconditions and image capture types may be selected. Based at least onthe direction of the visible light source and the image capturecriterion, a target camera location and/or a target object location inthe scene is generated for each image frame. Accordingly, an image of anobject in the scene is captured by a camera device based on the targetcamera location and/or the target object location. In one or moreembodiments, an automated image capture controller uses image-basedobject tracking technique to generate the target camera location and/orthe target object location, as well as to direct the camera device toperform the image capture.

FIG. 1.1 shows a system (100) in accordance with one or moreembodiments. In one or more embodiments, one or more of the modules andelements shown in FIG. 1.1 may be omitted, repeated, and/or substituted.Accordingly, embodiments of the invention should not be consideredlimited to the specific arrangements of modules shown in FIG. 1.1.

As shown in FIG. 1.1, the system (100) includes a camera device (110)having a camera lens (111 a) with a field-of-view (FOV) (141), anautomated image capture controller (120), a scene (140), object(s)(e.g., object A (142 a), object B (142 b)) within the scene (140), avisible light source (118), and a light sensing device (112). One ormore of the object(s) may appear within the FOV (141), such as theentirety of the object A (142 a) or a portion of the object B (142 b). Alight source is a source of light, which may be visible light orinfrared light. The term “light source” may also refer to acorresponding spot produced by the light source in a captured image. Thevisible light source (118) emits visible light to illuminate theobject(s) (e.g., object A (142 a), object B (142 b)) within the scene(140) such that object images (e.g., photographs, video recordings,etc.) may be captured by the camera device (110). In one or moreembodiments, one or more image is captured as a still image orphotograph. In one or more embodiments, one or more image is captured asa frame of video recording, for example referred to as an image frame ina movie. The lighting direction (e.g., lighting direction A (119 a),lighting direction B (119 b)) of a location in the scene (140) is thedirection along which visible light emitted from the visible lightsource (118) illuminates the particular location. The light sensingdevice (112) is a device that senses the direction of light emitted fromthe light source, such as from the visible light source (118). Forexample, the light sensing device (112) may include acomplementary-metal-oxide-semiconductor (CMOS) or charge-couple-device(CCD) sensing element mounted on a tilt-and-swivel platform. In one ormore embodiments, the tilt- and-swivel platform orients the CMOS or CCDsensing element to detect a direction (i.e., lighting direction A (119a)) of incoming light rays. In one or more embodiments, the visiblelight source (118) is sufficiently far away from the scene (140) suchthat the lighting directions everywhere in the scene (140) are parallelto each other. For example, the visible light source (118) may be thenatural sun light emitted from the sun such that the lighting directionA (119 a) and lighting direction B (119 b) are substantially the samedirection within the scene (140). In one or more embodiments, thevisible light source (118) is near or within the scene (140) such thatthe lighting directions in the scene (140) are location dependent. Insuch embodiments, the lighting direction A (119 a) and lightingdirection B (119 b) are substantially different directions within thescene (140).

In one or more embodiments of the invention, the camera device (110) isa device with one or more camera lens (e.g., camera lens (111 a)) andassociated components (e.g., optical sensor (not shown)) for takingphotographs and/or video recordings. A dedicated camera withcommunication capability is an example of the camera device (110). Inone or more embodiments, the camera device (110) is a mobile device,such as a mobile phone with a built-in camera, referred to as a smartphone. A smart phone may have a display with graphical user interfacethat occupy a large portion (e.g., 70% or larger) of the front surface.The camera lens (111 a) may be on the front surface or back surface ofthe smart phone. In one or more embodiments, the light sensing device(112) is integrated in the camera device (110) and is the same CMOS orCCD sensing element for taking photographs and/or video recordings.

In one or more embodiments, the scene (140) is a place where an actionor event, imaged by the camera device (110), occurs. In particular, theaction or event may be associated with the object(s) (e.g., object A(142 a), object B (142 b)). Further, one or more objects may bestationary, moving from time to time, or constantly moving within thescene (140). The field-of-view (FOV) (141) is an extent of the scene(140) that is imaged by the camera device (110) using the camera lens(111 a). In other words, an object (e.g., object A (142 a)) inside theFOV (140) will appear in an image captured and/or outputted by thecamera device (110). For example, the FOV (141) may correspond to asolid angle within which the camera lens (111 a) projects light input tothe associated optical sensor (not shown) of the camera device (110). Inone or more embodiments, the center (141 a) of the FOV (140) is alignedwith an optical axis (111) of the camera lens (111 a). The optical axis(111) is an imaginary line that passes through optical element(s) of thecamera lens (111 a). In one or more embodiments, the FOV (141)corresponds to different portions of the scene (140) according to howthe camera lens (111 a) is oriented toward, zoomed with respect to, orotherwise positioned relative to, the scene (140). In one or moreembodiments, the object A (142 a) may move across the scene (140) duringthe action or event. Object tracking is the action causing the cameralens (111 a) to be oriented toward, zoomed with respect to, or otherwisepositioned relative to the scene (140) such that the object A (142 a) iscontinuously within the FOV (141), or at a target position within theFOV (142), during image captures. Generally, the location of an object(e.g., object A (142 a)) in the scene corresponds to a position of theobject appearing in the FOV (142) or appearing in the captured image.Throughout this disclosure, the terms “object tracking” and “tracking”may be used interchangeably. In one or more embodiments, when the objectA (142 a) is not at the center (141 a) of the FOV (141), the object A(142 a) appears off-centered in the image captured by the camera device(110). The position offset (from the center of the image) of the objectA (142 a) appearing in the image is proportional to the angle betweenthe optical axis (111) and the direction (146) from the camera lens (111a) to the object A (142 a) in the FOV (141).

As illuminated by the visible light source (118), the image of an object(e.g., object A (142 a)) may be captured under different lightingconditions, such as front lighting, side lighting, and back lighting.The lighting angle is an angle formed between the lighting direction ofthe object and the direction from the camera lens (111 a) to the object.For example, with the object A (142 a) at the object location (144 a)and the camera device (110) at the camera location (145 a), the lightingangle (135) is an obtuse angle denoted as a. Generally, front lightingis illuminating the object by having the camera device (110) and thevisible light source (118) located at the same side of the object. Inthe front lighting condition, the lighting angle is smaller than apre-determined front lighting threshold, such as 60 degrees. Incontrast, back lighting is illuminating the object by having the cameradevice (110) and the visible light source (118) located at the oppositesides of the object. In the back lighting condition, the lighting angleexceeds a pre-determined back lighting threshold, such as 140 degrees.Side lighting is illuminating the object by having the camera device(110) and the visible light source (118) located at orthogonal sides ofthe object. In the side lighting condition, the lighting angle isbetween the pre-determined front lighting threshold and thepre-determined back lighting threshold, such as between 60 degrees and140 degrees. With the object A (142 a) at the object location (144 a)and the camera device (110) at the camera location (145 a), the lightingangle (135) is an angle between 60 degrees and 140 degrees such that theimage of the object A (142 a) may be captured by the camera device (110)under the side lighting condition. With the object A (142 a) at thetarget object location (144 b) and the camera device (110) at the cameralocation (145 a), the lighting angle would exceeds 140 degrees such thatthe image of the object A (142 a) may be captured by the camera device(110) under the back lighting condition. With the object A (142 a) atthe object location (144 a) and the camera device (110) at the targetcamera location (145 b), the lighting angle would be less than 60degrees such that the image of the object A (142 a) may be captured bythe camera device (110) under the front lighting condition. With thevisible light source (118) at a fixed location with respect to the scene(140), the image of the object A (142 a) may be captured by the cameradevice (110) under different lighting conditions by moving the object A(142 a) and/or the camera device (110) relative to each other andrelative to the visible light source (118). Although specific values(i.e., 60 degrees and 140 degrees) of the pre-determined front lightingthreshold and the pre-determined back lighting threshold are used in theexample above, other values of the pre-determined front lightingthreshold and the pre-determined back lighting threshold may also beused. Based on the lighting direction, the pre-determined front lightingthreshold, and the pre-determined back lighting threshold, the targetcamera location may be determined as an angular sector referenced by thecurrent object location to achieve a specified lighting condition.Similarly, the target object location may be determined as an angularsector referenced by the current camera location to achieve a specifiedlighting condition.

Subject ratio is the fraction (e.g., percentage) of the FOV (141)occupied by a portion of the object appearing within the FOV (141).Accordingly, the subject ratio determines the fraction (e.g.,percentage) of the image size occupied by the visible portion of theobject appearing in the image. In one or more embodiments, the subjectratio is dependent on the ratio of the focal length of the camera lens(111 a) over the distance between the object and the camera lens (111a). For example, as the distance between the object and the camera lens(111 a) increases, the subject ratio decreases. Close-up is a type ofimage capture that tightly frames an object, in particular a person. Inthe close-up type of image capture, the subject ratio exceeds apre-determined close-up size threshold, such as 70%. Half-portrait is atype of image capture that tightly frames the upper body of a person. Inthe half-portrait type of image capture, only the upper body of thesubject appears in the half-portrait image with the subject ratioexceeding a pre-determined half-portrait size threshold, such as 60%.Full-portrait is a type of image capture that tightly frames the entirebody of a person. In the full-portrait type of image capture, the entirebody of the subject appears in the full-portrait image with the subjectratio exceeding a pre-determined full-portrait size threshold, such as50%. Wide-angle is a type of image capture where the FOV as a solidangle exceeds a pre-determined wide-angle angle threshold, such as 120degrees. The subject ratio of an object appearing in the wide-angleimage is less than a pre-determined wide-angle size threshold, such as15%. Generally, the image of the object A (142 a) may be captured by thecamera device (110) with close-up, half-portrait, full-portrait, orwide-angle condition by moving the object A (142 a) and/or the cameradevice (110) relative to each other. Based on the focal length and zoomfactor of the camera lens (111 a), the target camera location may bedetermined as a radial distance range referenced by the current objectlocation to achieve a specified image capture type and subject ratio.Similarly, the target object location may be determined as a radialdistance range referenced by the current camera location to achieve aspecified image capture type and subject ratio. In one or moreembodiments, the subject ratio may be determined using an imagealgorithm that analyzes the pixels occupied by the object in the imagebased on a model of the object (e.g., a human, a cat, or a dog). Thetarget camera location and/or target object location may be determinedbased on the desired subject ratio. In one or more embodiments, theimage captured by the camera device may be cropped to achieve aspecified image capture type and subject ratio of the object, withoutchanging the target camera location and/or target object location. Inone or more embodiments, the image captured by the camera device may becropped to achieve a specified image capture type and subject ratio ofthe object, along with changing the target camera location and/or targetobject location.

In one or more embodiments, the movie script may include a machinereadable format that specifies, among other information, the lightingcondition and subject ratio for one or more image frames of the moviefilm. For example, the object(s) may be stationery or moving in theimage frames. According to the lighting condition and subject ratiospecified in each of the image frames, the objects (e.g., object A (142a), object B (142 b)) and/or the camera device (110) may be positionedwithin the scene (140) to generate the image frames.

In one or more embodiments, the automated image capture controller (120)includes a hardware component, a software component, or a combinationthereof. In one or more embodiments, the automated image capturecontroller (120) is configured to generate, based at least on thedirection of the visible light source (118) and a pre-determined imagecapture criterion, a physical configuration of the image capture. In oneor more embodiments, the image capture criterion of an image framespecifies the lighting condition, the image capture mode, and thesubject ratio of the image to be captured. The physical configurationincludes a target camera location and/or a target object location in thescene to capture the image.

In one or more embodiments, the automated image capture controller (120)is further configured to generate control information to direct theobjects (e.g., object A (142 a), object B (142 b)) and/or the cameradevice (110) to a target camera location and/or a target object locationin the scene (140) for image capture. For example, the controlinformation may include, or be used to generate, a visible or audibledirection instruction or an electronic control signal. In one or moreembodiments, the electronic control signal is a digital data messagespecifying location or orientation information used by a softwareapplication. For example, the digital data message may be transmittedwirelessly. In one or more embodiments, the electronic control signal isan analog electrical signal that triggers hardware to perform relocatingor orienting function. For example, the analog electrical signal may bea wireless signal. When the objects (e.g., object A (142 a), object B(142 b)) and/or the camera device (110) are positioned at the targetobject location(s) and/or the target camera location, the automatedimage capture controller (120) sends a signal or command to the cameradevice (110) to trigger the image capture. In one or more embodiments,the automated image capture controller (120) uses the method describedin reference to FIGS. 2.1 and 2.2 below to generate image capturecriteria for image frames according to the movie script and to controlthe camera device to capture the image frames that satisfy correspondingimage capture criteria.

Although the system (100) shown in FIG. 1.1 includes only one cameradevice and one light source, multiple camera devices and multiple lightsources may be possible. For example according to the movie script,multiple camera devices may be configured to capture images of a singleobject simultaneously from different camera locations according todifferent lighting conditions, different image capture types, anddifferent subject ratios.

FIG. 1.2 shows additional example details of the system (100) depictedin FIG. 1.1 above. In one or more embodiments, one or more of themodules and elements shown in FIG. 1.2 may be omitted, repeated, and/orsubstituted. Accordingly, embodiments of the invention should not beconsidered limited to the specific arrangements of modules shown in FIG.1.2.

As shown in FIG. 1.2, the system (100) includes essentially the samecomponents as depicted in FIG. 1.1 above with the exception ofadditional components descried below. In particular, the camera device(110) is held by a camera device holder (130) mounted on a movingplatform (129). In addition, the light source A (143 a) is shown as areflective infrared (IR) light source attached to the object A (141 a).A remote light emitter (114) emits a strobe light A (115) that shines onthe reflective light source A (143 a) to generate an object reflectedlight (116). In addition, the light source B (143 b) is a local IR lightemitter attached to the object B (142 b) and emitting a strobe light B(117). The object reflected light (116) and strobe light B (117) are IRlight rays captured by an IR sensor (111 b) of the camera device (110)via the camera lens (111 a) to generate one or more IR images.Throughout this disclosure, the remote light emitter and local lightemitter are referred to as light emitters, and the strobe light may beemitted by the remote light emitter or the local light emitter.

In one or more embodiments of the invention, the camera device (110),automated image capture controller (120), camera device holder (130),and moving platform (129) are communicatively coupled to each other. Inone or more embodiments of the invention, two or more of the remotelight emitter (114), camera device (110), automated image capturecontroller (120), camera device holder (130), and moving platform (129)are integrated into a single device. For example, at least a portion ofthe automated image capture controller (120) may be included in thecamera device (110). In another example, at least a portion of theautomated image capture controller (120) may be included in the cameradevice holder (130). In still another example, one part of the automatedimage capture controller (120) is included in the camera device (110)while another part of the automated image capture controller (120) isincluded in the camera device holder (130). Similarly, the remote lightemitter (114) and/or the light sensing device (112) may be integratedwith the camera device (110), automated image capture controller (120),or camera device holder (130).

In one or more embodiments, a light emitter (e.g., the remote lightemitter (114) or the local light emitter of the light source B (143 b))is any device that emits light. For example, the light emitter may emitlight across a large angle (e.g., exceeding 45 degree plane angle, 1square radian solid angle, etc.) as a flood light emitter. In anotherexample, the light may emit a collimated light beam as a collimatedlight emitter. The remote light emitter (114) may be separate, e.g., bycertain distance such as 1 meter or more, from the object A (142 a). Inone or more embodiments, the light emitter includes alight-emitting-diode (LED). In one or more embodiments, the strobe light(e.g., strobe light A (115), strobe light B (117)) changes intensityand/or wavelength from time to time. For example, the strobe light mayproduce a free-running light change pattern according to a particularduty cycle (i.e., a percentage of time when the light pattern has abright level) and repetition rate (i.e., a number of time the intensitychanges during a unit time period). As used herein, light change patternis a pattern of intensity and/or wavelength change in the light. In oneor more embodiments, the light emitter produces a light change patternwith a low repetition rate (e.g., 10 hertz, 20 hertz, etc.) comparing toa frame rate of the camera device (110). The frame rate is a number ofimages (e.g., a burst of still images or a video recording) captured bythe camera device (110) during a unit time. In one or more embodiments,the light emitter produces a light change pattern that is synchronizedwith the frame rate of the camera device (110). In one or moreembodiments, the light emitter emits an IR light. In other words, thestrobe light has an infrared wavelength, e.g., between 700 nanometers(nm) and 1 millimeter (mm). Throughout this disclosure, the term“infrared wavelength” refers to a wavelength between 700 nm and 1 mm. Inone or more embodiments, the light change pattern produced by the strobelight represents encoded digital data. For example, the encoded digitaldata produced by an infrared strobe light may be similar to an infraredremote control code.

In one or more embodiments of the invention, the reflective light sourceA (143 a) is a reflective region of the object A (142 a) that reflectsthe strobe light A (115) to generate the object reflected light (116).In this context, the reflective light source A (143 a) is said to emitthe object reflected light (116). In one or more embodiments, thereflective region has a higher reflectance for infrared wavelength thanfor visible wavelength. For example, the higher reflectance may be basedon reflective material with a higher reflectivity for infraredwavelength than for visible wavelength. While both the strobe light A(115) and ambient light (not shown) shine on the reflective region, theobject reflected light (116) may have higher infrared reflected contentfrom the strobe light A (115) than visible reflected content fromambient visible light. In one or more embodiments, the object A (142 a)is a human, animal, robot, or any other moving item, and the reflectivelight source A (143 a) includes a reflective material attached to theobject A (142 a). For example, the reflective material may be part of awrist band, arm band, belt, finger ring, pendant, necklace, hat, glove,clothing, etc. worn by or otherwise attached to the human, animal,robot, or any other moving item. In one or more embodiments, thereflective material may include metal, dielectric material, or acombination of metal and dielectric material. In one or moreembodiments, the reflective material may be a coating layer or paintedfilm on the surface of the aforementioned wrist band, arm band, belt,finger ring, pendant, necklace, hat, glove, clothing, etc. For example,the coating layer or painted film may include infrared reflectivepigments such as titanium dioxide. In particular, the titanium dioxidemay have a reflectance exceeding 75% for the infrared wavelength.

In one or more embodiments, the reflective material includes a geometricpattern having geometrically varying reflectivity for infraredwavelength to produce a geometric light change pattern. In particular,the geometric pattern of the reflective material produces a spatialvariation of the object reflected light that is captured by the cameralens as additional distinction from ambient light. In other words, thegeometric pattern enhances the accuracy of detection of the reflectivelight source. As used herein, geometric light change pattern is apattern of intensity change in the light according to the geometricpattern. For example, the geometric pattern may be created by surfacecoating/painting using the aforementioned infrared reflective pigmentssuch as titanium dioxide. In one or more embodiments, the objectreflected light (116) from the reflective light source A (143 a)includes time modulation based on the aforementioned light changepattern originated from the remote light emitter (114) and/or spatialmodulation based on the geometric light change pattern of the reflectivelight source A (143 a).

In one or more embodiments of the invention, the camera device holder(130) is configured to mechanically hold the camera device (110) and toadjust, in response to a control signal from the automated image capturecontroller (120), the FOV (141) of the camera lens (111 a). For example,the camera device holder (130) may include a motorized tilt-and-swivelplatform for adjusting a camera angle of the camera lens (111 a). Inanother example, the camera device holder (130) may include a motorizedhorizontal and vertical sliding device for adjusting a position of thecamera lens (111 a) relative to the scene (140). The sliding device mayinclude a mechanical stage for holding and moving the camera device(110). Examples of the camera device holder (130) are described inreference to FIGS. 3.1 and 3.2 below.

In one or more embodiments, the automated image capture controller (120)uses the method described in reference to FIG. 2.2 to perform objecttracking based on the light sources attached to the objects (e.g.,object A (142 a), object B (142 b)). The automated image capturecontroller (120) is further configured to determine the locations of theobjects (e.g., object A (142 a), object B (142 b)) based on the objecttracking. As described in reference to FIG. 1.1 above, the angle betweenthe optical axis (111) and the direction (146) from the camera lens (111a) to the object A (142 a) may be determined based on the positionoffset (from the center of the image) of the object A (142 a) appearingin the captured IR image. In one or more embodiments, the automatedimage capture controller (120) is configured to analyze the direction ofthe optical axis (111) and the angle of the object A (142 a) when thecamera device (110) is at different locations. For example with thecamera device (110) moved to different locations, the optical axis (111)may be maintained at a fixed direction by the camera device holder (130)based on control information from the automated image capture controller(120). Accordingly, the location of the object A (142 a) may bedetermined using triangulation techniques based on the position offsetsof the object A (142 a) appearing in the IR images captured from thedifferent camera locations. In one or more embodiments, the automatedimage capture controller (120) may determine the location of the objectA (142 a) using methods other than the triangulation. For example, thedistance between the object A (142 a) and the camera device (111 a) maybe determined using a laser range finder or an ultrasonic range finder.Accordingly, the automated image capture controller (120) may determinethe location of the object A (142 a) using the trilateration techniquebased on distances measured from multiple camera locations.

In one or more embodiments, the automated image capture controller (120)is configured to generate a region-of-interest for tracking the object A(142 a). In one or more embodiments, the region-of-interest is generatedbased on a dynamic model of the object A (142 a) and the location of thelight source A (143 a) in a captured IR image. For the example where theobject A (142 a) is a human, the dynamic model describes mechanicallinkages among movable elements (e.g., arm, wrist, hand, head, torso,leg, etc.) of the human body. The dynamic model takes into accountpossible postures of the human body. For example considering that thelight source A (143 a) is attached to a particular movable element(e.g., wrist) of the human body, a range limitation of other movableelements (e.g., arm, hand, head, torso, leg, etc.) relative to theparticular movable element (e.g., wrist) may be determined based on thedynamic model to generate a probability map representing theregion-of-interest where the human body may appear in the scene (140).The region-of-interest may facilitate the calculation of an exactstructure of the object in a limited area (ie., within theregion-of-interest) in the image, which may reduce the requirement ofcomputing resources significantly. Accordingly, the distance between theobject A (142 a) and the camera device (110) to satisfy a particularimage capture type (e.g., close-up, half-portrait, full-portrait, andwide-angle) may be determined based on the size of the object appears inthe image and the solid angle of the FOV (141), which is dependent onthe focal length and zoom factor of the camera lens (111 a). In one ormore embodiments, the automated image capture controller (120) isconfigured to determine the distance satisfy the image capture typespecified by the movie script based on the region-of-interest and thefocal length and zoom factor of the camera lens (111 a). Accordingly,the distance is used in determining the target camera location and/orthe target object location based on the current object location and/orcurrent camera location according to the movie script. In one or moreembodiments, the automated image capture controller (120) is configuredto determine the subject ratio of the object appears in the image.Accordingly, the subject ratio is used in determining the target cameralocation and/or the target object location based on the current objectlocation and/or current camera location according to the movie script.

In one or more embodiments of the invention, the moving platform (129)is a carrier that moves about within and/or beyond the scene (140)according to the control information generated by the automated imagecapture controller (120). In one or more embodiments, the movingplatform (129) is a robot, a motorized cart, or a drone that holds thecamera device (110) and driven by a location control signal to move tothe target camera location.

In one or more embodiments, the moving platform (129) is a human userholding the camera device (110). The control information from theautomated image capture controller (120) is outputted by the cameradevice (110) as an instruction directing the human user to move to thetarget camera location.

In one or more embodiments, the objects (e.g., object A (142 a), objectB (142 b)) may include a human object, such as a movie actor. In suchembodiments, the control information from the automated image capturecontroller (120) may include an instruction directing the human objectto move to the target object location.

Although the light sources shown in FIG. 1.2 include both a local lightemitter and a reflective light source, other configurations may also bepossible where only local light emitters or only reflective lightsources are used. For example, both light source A (143 a) and lightsource B (143 b) may be local light emitters. In another example, bothlight source A (143 a) and light source B (143 b) may be reflectivelight sources shone by a single remote light emitter (114).

Although the system (100) shown in FIG. 1.2 includes only one cameradevice and camera device holder, multiple camera devices and multiplecamera device holders may be possible. For example, multiple cameradevices may be configured to track different objects with differentencoded light sources simultaneously without conflict.

FIG. 1.3 shows details of the automated image capture controller (120)in accordance with one or more embodiments. The following description ofFIG. 1.3 refers to various components depicted in FIGS. 1.1 and 1.2above. In one or more embodiments, one or more of the modules andelements shown in FIG. 1.3 may be omitted, repeated, and/or substituted.Accordingly, embodiments of the invention should not be consideredlimited to the specific arrangements of modules shown in FIG. 1.3.

As shown in FIG. 1.3, the automated image capture controller (120)includes a hardware processor (121), memory (122), and repository (123).In one or more embodiments of the invention, the hardware processor(121) corresponds to the computer processors (702) depicted in FIG. 7.1below. Similarly, the memory (122) and repository (123) correspond tothe non-persistent storage (704) and/or persistent storage (706)depicted in FIG. 7.1 below. For example, the memory (122) may storesoftware instructions that, when executed, cause the hardware processor(121) to perform image capture and object tracking functionalities ofthe automated image capture controller (120). In one or moreembodiments, the automated image capture controller (120) performsvarious functionalities according to the method flowcharts described inreference to FIGS. 2.1 and 2.2 below. In one or more embodiments, thememory (122) stores instructions to perform one or more portions of themethod flowcharts described in reference to FIGS. 2.1 and 2.2 below. Inone or more embodiments, the automated image capture controller (120)and the camera device (110) are integrated into a single device. In suchembodiments, the instructions to perform one or more portions of themethod flowcharts described in reference to FIGS. 2.1 and 2.2 are partof a mobile application, or mobile app, which is a user-installablesoftware application designed to run on a smart phone or other mobiledevices.

Further as shown in FIG. 1.3, the repository (123) includes a sequenceof IR images (126), a light change pattern (124), a displacement (125),a movement parameter (128), and a target position (127). In particular,the sequence of IR images (126) includes consecutive images (e.g., IRimage A (126 a)) captured by the camera device (110). For example, theIR image A (126 a) corresponds to a portion of the scene (140) that iscovered by the FOV (141) at a particular time point. The light changepattern (124) is a pattern of light intensity and/or wavelengthalternating between different intensity levels and/or wavelengths acrossthe sequence of images (126).

In one or more embodiments, the light change pattern (124) correspondsto a spot in each IR image of the sequence of IR images (126). Forexample, the spot may be defined by a pixel position or a collection ofconnected pixel positions in each IR image. In this context, the lightchange pattern (124) is referred to as a local light change patterncaptured by the camera device (110). In one or more embodiments, thelight change pattern (124) is caused by a strobe light (e.g., strobelight A (115), strobe light B (117)) and indicates a position of thelight source (e.g., light source A (143 a), light source B (143 b))within each IR image. In other words, the position of the light source(e.g., light source A (143 a), light source B (143 b)) within each IRimage may be determined based on where the light change pattern (124) isfound across the sequence of IR images (126). For example, the lightchange pattern (124) indicates that the light source is at the positionA (127 a) in the IR image A (126 a). Similarly, each other IR image inthe sequence of IR images (126) is associated with a position of thelight source. The target position (127) is a pre-determined positionthat the automated image capture controller (120) is configured fortracking the object (e.g., object A (142 a), object B (142 b)). Forexample, the target position (127) may be defined as the center of theFOV (141), which corresponds to the center of each IR image of thesequence of IR images (126). In other words, the automated image capturecontroller (120) is configured to adjust the FOV (141) such that thetracked object appears at the center (i.e., target position (127)) inthe IR image after the adjustment. In other examples, the targetposition (127) may be defined as different positions from the center ofthe FOV (141). The displacement (125) is the position offset between thetarget position (127) and the position (e.g., position A (127 a)) of thelight source within an IR image. In one or more embodiments, thedisplacement (125) includes a horizontal distance and a verticaldistance. The displacement (125) may be represented based on a number ofpixels or any other suitable distance scale. In one or more embodiments,the object may be a moving object such that the position (e.g., positionA (127 a)) of the light source may vary from one image to next in thesequence of images (126). In such embodiments, the movement parameter(128) is a rate of change of the position (e.g., position A (127 a)) ofthe light source over time. For example, the movement parameter (128)may include a change in the position (e.g., position A (127 a)) of thelight source from one image to next in the sequence of images (126).Depending on the moving direction of the tracked object, the movementparameter (128) may include a horizontal portion and a vertical portion.Mathematically, the movement parameter (128) corresponds to a derivativeof the displacement (125) over time.

In one or more embodiments, light change pattern (124) includes a lightintensity change and/or a light wavelength change. In particular, thelight intensity change and/or light wavelength change is associated witha repetition rate of the change. In one or more embodiments, the lightintensity change and/or light wavelength change with associatedrepetition rate defines a digital code. For example, the digital codemay include a header and subsequent digital pattern where the header andsubsequent digital pattern may be re-occurring within the light changepattern (124). The digital code may be distinct for each light source inthe scene (140) and used for identifying the object attached with thelight source. In this context, the digital code defined by the lightintensity change and/or light wavelength change with associatedrepetition rate of the light change pattern (124) is referred to as anobject-identifying code (124 a). In one or more embodiments, the lightintensity change and/or a light wavelength change are temporal changeproduced by the light emitter. In one or more embodiments, the lightintensity change and/or a light wavelength change further includespatial change produced by the aforementioned geometric pattern of areflective light source.

In one or more embodiments, the automated image capture controller (120)performs the FOV adjustment functionalities based on the sequence ofimages (126), light change pattern (124), displacement (125), movementparameter (128), and target position (127) described above.Specifically, the automated image capture controller (120) performs theFOV adjustment functionalities using the method described in referenceto FIG. 2 below. An example of the sequence of images (126), lightchange pattern (124), object-identifying code (124 a), displacement(125), and movement parameter (128) is described in reference to FIGS.4-6 below.

FIG. 2.1 shows a flowchart in accordance with one or more embodiments.The process shown in FIG. 2.1 may be executed, for example, by one ormore components discussed above in reference to FIGS. 1.1, 1.2, and 1.3.One or more steps shown in FIG. 2.1 may be omitted, repeated, and/orperformed in a different order among different embodiments of theinvention. Accordingly, embodiments of the invention should not beconsidered limited to the specific number and arrangement of steps shownin FIG. 2.1.

Initially, in Step 241, a direction of a visible light source from alocation in the scene is generated. In one or more embodiments of theinvention, a light sensing device is placed at one or more locations inthe scene to determine the direction of the visible light source fromeach location. In one or more embodiments, the light sensing device is avisible light sensor embedded in a camera device. With the camera devicelocated at a particular location and the visible light sensor orientedtoward various directions, the visible light sensor is used to capture anumber of images corresponding to these directions. Each image is aportion of a photo sphere centered at the location of the camera device,or more specifically centered at the location of the visible lightsensor. The captured images are analyzed using a hardware processor toselect an image where the visible light source appears. If the visiblelight source appears in multiple images, the image having the visiblelight source closest to the center of the image is selected.Accordingly, the direction of the visible light sensor corresponding tothe selected image is determined as representing the direction of thevisible light source, i.e., the lighting direction at the location wherethe light sensing device is placed.

In one or more embodiments, the visible light source (e.g., natural sunlight from the sun) is sufficiently far away from the scene such thatthe lighting directions everywhere in the scene are parallel to eachother. In such embodiments, the lighting direction throughout the scenemay be determined as described above by placing the light sensing deviceat any single location in the scene. In one or more embodiments, thevisible light source is near or within the scene such that the lightingdirections in the scene are location dependent. In such embodiments, thelight sensing device may be placed at two or more locations in the scenewhere two or more lighting directions are determined as described above.The location of the visible light source may be mathematically derivedusing triangulation based on the two or more directly determinedlighting directions. Accordingly, the lighting direction at anyparticular location in the scene is the direction from themathematically derived location of the visible light source to theparticular location in the scene. In other words, the lighting directionat the particular location may be determined without placing the lightsensing device at the particular location.

In Step 242, a movie script is analyzed to determine an image capturecriterion of an image in a movie. In one or more embodiments of theinvention, the image capture criterion specifies a lighting conditionand an image capture type of an object in the image. Accordingly, themovie script is analyzed to determine one of front lighting, sidelighting, and back lighting of the object in the particular image. Inaddition, the movie script is analyzed to determine one of close-up,half-portrait, full-portrait, and wide-angle image capture type for theparticular image. In one or more embodiments, the movie script is in amachine readable format and is analyzed by a computer processor. Thetarget position and subject ratio in the particular image is alsodetermined for each of the image capture type based on the movie script.In one or more embodiments, the object in the scene includes multipleactors such as a main character and a supporting character. In one ormore embodiments, the actors may be human, animal, robot, or any othermoving item. In such embodiments, the movie script is further analyzedto identify which of the main character and the supporting characterthat the lighting condition and image capture type pertain to.

In Step 243, a target camera location and/or a target object location isgenerated based on the lighting direction of the scene and the specifiedlighting condition and image capture type for the particular image. Inone or more embodiments, the object is stationery in the particularimage and the target camera location is generated to satisfy thespecified lighting condition and image capture type for the particularimage. For example, the target camera location may be determined to bewithin certain angular sector with respect to the stationery objectlocation based on the specified lighting condition. Further, the targetcamera location may be determined to be within certain radial distancerange with respect to the stationery object location based on thespecified subject ratio. Accordingly, the target camera location may bedetermined based on the intersection of the angular sector and theradial distance range determined above. The target camera location maybe further determined based on additional constraints, such as tominimize the distance between the current camera location and the targetcamera location or to avoid obstruction between the target cameralocation and the stationery object location. In one or more embodiments,the stationery object location is determined using the method describedin reference to FIG. 2.2 below.

In one or more embodiments, the camera device is stationery at apre-determined location for the particular image and the target objectlocation is generated to satisfy the specified lighting condition andimage capture type for the particular image. For example, the targetobject location may be determined to be within certain angular sectorwith respect to the stationery camera location based on the specifiedlighting condition. Further, the target object location may bedetermined to be within certain radial distance range with respect tothe stationery camera location based on the specified subject ratio.Accordingly, the target object location may be determined based on theintersection of the angular sector and the radial distance rangedetermined above. The target object location may be further determinedbased on additional constraints, such as to minimize the distancebetween the current object location and the target object location or toavoid obstruction between the target object location and the stationerycamera location. In one or more embodiments, the current object locationis determined using the method described in reference to FIG. 2.2 below.

In one or more embodiments, the object in the scene includes multipleactors such as a main character and a supporting character. In one ormore embodiments, the actors may be human, animal, robot, or any othermoving item. In such embodiments, the target camera location and/or thetarget object location is generated pertaining to either the maincharacter or the supporting character for the particular image accordingto the movie script. In one or more embodiments, the main character orthe supporting character are tracked using the method described inreference to FIG. 2.2 below.

In Step 244, the camera device and/or the object is directed to thecorresponding target location determined in Step 243 above. For example,directing the camera device and/or the object to the correspondingtarget location may be by way of automatically controlling a roboticplatform or by automatically generating audible/visible instructions toa human holding the camera device and/or the object. In one or moreembodiments, the object is stationery in the particular image and thecamera device is directed to the target camera location to satisfy thespecified lighting condition and image capture type for the particularimage. In one or more embodiments, the camera device is disposed on amoving platform. In such embodiments, a location control signal isgenerated to direct a moving platform to move to the target cameralocation.

In one or more embodiments, the moving platform is a robotic platformholding the camera device and driven by the location control signal tomove to the target camera location. In one or more embodiments, themoving platform is a human user holding the camera device, where thelocation control signal causes the camera device to output aninstruction directing the human user to move to the target cameralocation.

In one or more embodiments, the camera device is stationery in theparticular image and the object is directed to the target cameralocation to satisfy the specified lighting condition and image capturetype for the particular image. In one or more embodiments, the object isa human actor and an instruction is generated directing the human actorto move to the target camera location. For example, the instruction maybe a visible message or an audio message.

In Step 245, the image capture criterion is verified upon the cameradevice and/or the object arriving the corresponding target location. Inone or more embodiments, a measure of lighting of the object is capturedusing the light sensing device to confirm that the specified lightingcondition is satisfied. For example, the measure of lighting may includethe lighting direction and/or contrast of the object measured using thelight sensing device at the location of the camera device. In one ormore embodiments, verifying the image capture criterion is optional. Insuch embodiments, verifying the image capture criterion may not beperformed.

In Step 246, an image of the object in the scene is caused to becaptured by the camera device based on the verified image capturecriterion. For example, a command may be transmitted to a camera deviceto capture the image. As noted above, in one or more embodiments,verifying the image capture criterion is optional and an image of theobject in the scene may be captured by the camera device withoutverifying the image capture criterion. In one or more embodiments, theimage of the object is captured using the method described in referenceto FIG. 2.2 below.

As noted above, in one or more embodiments, the light sensing device isthe sole image sensor of the camera device. In such embodiments, thelight sensing device used to determine the lighting direction and verifythe image capture criterion is the same image sensor that is used tocapture the image of the object.

In one or more embodiments, control information is generated to change afield-of-view of the camera device to capture the image such that theobject appears in the image to substantially align with a targetposition within the field-of-view of the camera device. In one or moreembodiments, the control information includes camera orientation and/orzoom factor. For example, the control information may be used to furthergenerate an orientation control signal to orient the field-of-view ofthe camera device toward the object from the target camera location. Inanother example, the control information may be used to further generatea zoom control signal to adjust the camera device.

In one or more embodiments, control information is generated to change acrop field of the image such that the object appears in the image tosubstantially align with a target position within the crop field of theimage. In one or more embodiments, the control information specifies acropping region within the image. For example, the control informationmay be used during post production phase of the movie making.

In Step 247, a determination is made as to whether another image is tobe captured according to the movie script. If the determinationindicates to continue capturing another image, the method returns toStep 242. If the determination indicates that no more image is to becaptured, the method ends.

FIG. 2.2 shows a flowchart in accordance with one or more embodiments.The process shown in FIG. 2.2 may be executed, for example, by one ormore components discussed above in reference to FIGS. 1.1, 1.2, and 1.3.One or more steps shown in FIG. 2.2 may be omitted, repeated, and/orperformed in a different order among different embodiments of theinvention. Accordingly, embodiments of the invention should not beconsidered limited to the specific number and arrangement of steps shownin FIG. 2.2.

Initially, in Step 251, a light source within a scene is activated. Inone or more embodiments of the invention, the light source is areflective region attached to an object in the scene. In suchembodiments, the reflective light source is activated by using a remotelight emitter to emit and project a strobe light onto the reflectiveregion. For example, the strobe light is emitted with a free-runninglight pattern when the remote light emitter is turned on. As a result,the strobe light is reflected by the reflective region to generate anobject reflected light having the same free-running light pattern. Inone or more embodiments of the invention, the light source is a locallight emitter attached to an object in the scene. In such embodiments,the light source is activated by activating the local light emitter toemit a strobe light. For example, the strobe light is emitted with afree-running light pattern when the local light emitter is turned on.

In one or more embodiments, the strobe light and the object reflectedlight have a low repetition rate (e.g., 10 hertz, 20 hertz, etc.)comparing to a frame rate of a camera device. In one or moreembodiments, the strobe light and the object reflected light aresynchronized with the frame rate of the camera device. For example, thestrobe light may be initiated and/or synchronized based on a triggersignal sent from a tracking controller and/or the camera device. In oneor more embodiments, intensity and/or wavelength of the strobe lightand/or the object reflected light are changed with associated repetitionrate(s) to define the object-identifying code.

In Step 252, a sequence of images of the scene is captured by a cameradevice. In particular, the object is within the field-of-view (FOV) ofthe camera lens and appears in the sequence of images. For example, thesequence of images may include or be part of a burst of still images. Inanother example, the sequence of images may include or be part of avideo recording. In one or more embodiments, the sequence of images ofthe scene is captured while the light source emits the object reflectedlight or strobe light. In one or more embodiments, the frame rate of thesequence of images is selected based on the duty cycle and/or repetitionrate of the light source such that consecutive images (or a pair ofimages with a particular separation in the sequence) include alternatingbright level and dark level, and/or alternating wavelengths from thelight emitter. For example, the remote or local light emitter may befree running and the frame rate is selected based on the duty cycleand/or repetition rate of the free running light source. In one or moreembodiments, the duty cycle and/or repetition rate of the light emitteris selected based on the frame rate of the sequence of images such thatconsecutive images (or a pair of images with a particular separation inthe sequence) include alternating bright level and dark level, and/oralternating wavelengths from the light emitter. For example, the framerate may be pre-determined and the light emitter is synchronized to theframe rate, e.g., based on a trigger signal from the camera device.

In Step 253, based on a local light change pattern across the sequenceof images, the light source is detected in the scene. Specifically, theobject reflected light or strobe light from the light source causeschanges in light intensity and/or wavelength received by an opticalsensor of the camera device resulting in the local light change patternacross the sequence of images. In one or more embodiments, the intensityof the light source is adjusted to control the size where the locallight change pattern is found in each image. For example, the size maybe limited to a percentage (e.g., 1%, 3%, etc.) of the horizontal andvertical dimensions of the FOV. In one or more embodiments, the positionand the size of the detected light source are defined where thedifference in alternating bright level and dark level, and/oralternating wavelengths, in consecutive images, as recognized by theoptical sensor of the camera device, exceeds a pre-determined threshold.

In one or more embodiments, a pair of images in the sequence of imagesare compared by subtraction of intensity and/or wavelength values ofcorresponding pixels. Specifically, the intensity and/or wavelengthvalues are generated by the optical sensor. For example, the intensityvalues may correspond to pixel output values of a monochrome CMOS(complementary metal oxide semiconductor) sensor. In another example,output values of RGB CMOS sensor may be analyzed to determine thewavelength value of each pixel. In particular, the intensity and/orwavelength value of a pixel in one image is subtracted from theintensity and/or wavelength value of the corresponding pixel in anotherimage to generate a subtraction result. The pixel where the differencein alternating bright level and dark level, and/or alternatingwavelengths, is found in the subtraction result is selected as part ofthe detected light source in the image. Depending on the dutycycle/repetition rate of the light source versus the frame rate of thesequence of images, the pair of images may be consecutive images or twoimages separated by a particular number of images, such as every threeimages, etc.

In one or more embodiments, an object-identifying code is extracted fromthe local light change pattern to identify the light source frommultiple light sources within the scene. In one or more embodiments, thelocal light change pattern is analyzed to detect a pre-determined headerpattern. Once detected, the pattern following the pre-determined headerpattern is extracted as the distinct code identifying a particular lightsource or object. In one or more embodiments, the distinct code has apre-determined length or number of digital bits that is used to de-limitthe object-identifying code. In other embodiments, theobject-identifying code may be de-limited based on other criteria.

In one or more embodiments, multiple objects (e.g., a main character anda supporting character according to the movie script) within the sceneare tracked con-currently where each object is attached with anindividual light source with distinct object-identifying code. In otherwords, multiple light change patterns are found at multiple locationsacross the sequence of images where each light change pattern includes adistinct object-identifying code that is different from anyobject-identifying code of other light change pattern. Accordingly, eachlight source is identified as distinct from other light sources based onrespective light change patterns. Because each light source is uniquelyassociated with the object it is attached, each object is trackedindividually across the sequence of images based on respectiveobject-identifying codes.

In one or more embodiments, multiple light sources are detected andidentified by iterating Steps 252 through 254. For example, eachiteration may be based on a particular object-identifying code specifiedby a user input. In one or more embodiments, an image from the sequenceof images is presented to a user interface window where a user mayselect an object by clicking or otherwise selecting one of multipledetected light sources. Once selected, the object-identifying code ofthe selected light source is used to determined the location of theselected light source corresponding to the selected object. Accordingly,the selected object is tracked for image capturing in Steps 255 through259. From time to time, the user may select a different object using theuser interface, once the tracked object is switched to a differentobject, a different object-identifying code of the newly selected lightsource is used to determined the location of the newly selected lightsource corresponding to the newly selected object. Accordingly, thenewly selected object is tracked for image capturing in Steps 255through 259.

In Step 254, the sequence of images is analyzed to determine a positionof the detected and identified light source in at least one image, andoptionally a movement of the light source across the sequence of images.In one or more embodiments, the position of the light source isdetermined based on where the difference in alternating bright level anddark level, and/or alternating wavelengths in the sequence of images, asrecognized by the optical sensor of the camera device, exceeds thepre-determined threshold. In one or more embodiments, the movement ofthe light source is determined based on a rate of change of the locationover the sequence of images.

In Step 255, a determination is made as to whether to change the cameralocation. In one or more embodiments, the camera location is changed toperform triangulation to determine the location of the object in thescene. If the determination is positive, i.e., the camera location is tobe changed for performing triangulation, the method returns to Step 252with the camera device moved to a different location. If thedetermination negative, i.e., the camera location is not to be changed,the method proceeds to Step 256.

In Step 256, the position of the light source is analyzed to generate aresult. In one or more embodiments, the position of the light source anda target position within an image are compared to generate the result.In one or more embodiments, the result includes the displacement betweenthe light source position and the target position. In one or moreembodiments, the displacement may vary from one image to next in thesequence of images, indicating that the object is a moving object. Insuch embodiments, the rate of change of the displacement over time,e.g., from one image to next, is computed as a movement parameter. Inone or more embodiments, the displacement between the light sourceposition and the target position, and optionally the movement parameter,are used to generate the control information for object tracking duringimage capture.

In one or more embodiments where triangulation is performed, theposition offset (from the center of the image) of the light sourceappearing in the image is proportional to the angle between the opticalaxis of the camera device and the direction from the camera lens to theobject in the scene. Based on the position offsets of the light sourceappearing in two or more images captured by the camera device atdifferent locations, the location of the object in the scene isdetermined using a triangulation survey technique. In one or moreembodiments, satisfying the lighting condition and image capture typespecified in the movie script is based on determining the location ofthe object in the scene as described above.

In Step 257, control information is generated based on the result. Inone or more embodiments, control information is generated to change afield-of-view of the camera device to capture the image such that theobject appears in the image to substantially align with a targetposition within the field-of-view of the camera device. In one or moreembodiments, the control information includes camera orientation and/orzoom factor. For example, the control information may be used to furthergenerate an orientation control signal to orient the field-of-view ofthe camera device toward the object from the target camera location. Inanother example, the control information may be used to further generatea zoom control signal to adjust the camera device.

In one or more embodiments, control information is generated to change acrop field of the image such that the object appears in the image tosubstantially align with a target position within the crop field of theimage. In one or more embodiments, the control information specifies acropping region within the image. For example, the control informationmay be used during post production phase of the movie making.

In Step 258, a control signal is sent to a camera device holder (e.g., acamera handheld grip, a tilt-and-swivel device, etc.) where the cameradevice is mounted. Accordingly, the orientation of the camera lens or arelative position of the camera device is adjusted in the oppositedirection to the displacement.

In one or more embodiments, the control signal is generated based on thecontrol information of Step 257. In one or more embodiments, the controlsignal is configured to adjust the orientation of the camera lens in theopposite direction to the displacement. In one or more embodiments, thecontrol signal is configured to adjust the relative position of thecamera with respect to the scene in the opposite direction to thedisplacement. In one or more embodiments, the movement parameter isconsidered in fine tuning the amount of adjustment caused by the controlsignal.

In Step 259, a substantial alignment between the target position and thelight source is detected within the FOV of the camera device. Inparticular, the substantial alignment is a result of adjusting theorientation of the camera lens or a relative position of the cameradevice in the opposite direction to the displacement.

In Step 260, in response to detecting the substantial alignment, animage of the scene is captured according to the movie script. In one ormore embodiments, a signal or command is sent to the camera device totrigger the image capture. In one or more embodiments, consecutiveimages are continuously captured and outputted by the camera device at aregular repetition rate (i.e., frame rate). In such embodiments, thesequence of images that is analyzed to generate the control signal islimited to a rolling time window (e.g., a rolling sequence of 2consecutive images, 5 consecutive images, 10 consecutive images, etc.)that precedes the image captured according to the movie script.

In one or more embodiments, the sequence of images that is analyzed togenerate the control signal is designated as control information withoutbeing outputted by the camera device. In contrast, the image where thelight source (hence the object) substantially aligns with the targetposition is outputted by the camera device to be part of the movie. Forexample, the control information may be stored separately from the movieimages until being discarded or otherwise removed from the cameradevice.

In Step 261, a determination is made as to whether image capturing is tocontinue at the current camera location. If the determination ispositive, i.e., the image capturing is to continue with the camera atthe current camera location, the method proceeds to Step 252. If thedetermination is negative, i.e., the image capturing is not to continue,the method ends.

FIGS. 3.1, 3.2, 4, 5, and 6 show various examples in accordance with oneor more embodiments of the invention. The examples shown in FIGS. 3.1,3.2, 4, 5, and 6 may be, for example, based on one or more componentsdepicted in FIGS. 1.1, 1.2, and 1.3 above and the method flowchartsdepicted in FIGS. 2.1 and 2.2 above. In one or more embodiments, one ormore of the modules and elements shown in FIGS. 3.1, 3.2, 4, 5, and 6may be omitted, repeated, and/or substituted. Accordingly, embodimentsof the invention should not be considered limited to the specificarrangements of modules shown in FIGS. 3.1, 3.2, 4, 5, and 6.

FIG. 3.1 shows a camera mobile device handheld grip (800) as an exampleof the camera device holder (130) depicted in FIG. 1.2 above. Inaddition, a camera mobile device (201) (e.g., a smart phone having acamera lens (220)), mechanically held by the camera mobile devicehandheld grip (800), is an example of the camera device (110) depictedin FIG. 1.2 above. Further, the camera lens (220) is an example of thecamera lens (111 a) depicted in FIG. 1.2 above. In one or moreembodiments, the IR sensor (112 b) and the light sensing device (112)depicted in FIG. 1.2 above are integrated with a CMOS or CCD sensingelement associated with the camera lens (111 a) for taking photographsand/or video recordings by the camera mobile device (110).Correspondingly, for taking photographs and/or video recordings by thecamera mobile device (201), the camera lens (220) is associated with aCMOS or CCD sensing element that is an example of integrated IR sensor(112 b) and light sensing device (112) depicted in FIG. 1.2 above.

In one or more embodiments of the invention, the camera mobile devicehandheld grip (800) is an electro-mechanical assembly that includes aholder (221), a tilting shaft (203), an tilting motor (213), a rotatingshaft (209), a rotating motor (219), and a handheld grip (222). Theholder (221) is configured to mechanically hold the camera mobile device(201) and mechanically couple to the tilting shaft (203). The handheldgrip (222) is configured to maintain, while being handheld by a viewer,mechanical stability of the camera mobile device handheld grip (800).Although not explicitly shown, the handheld grip (222) includes acommunication interface configured to communicate with the camera device(110) and/or the automated image capture controller (120) depicted inFIG. 1.2 above. For example, the communication interface may be based onBluetooth, NFC, USB, or other wireless/wired communication interfaces.In one or more embodiments, the rotating shaft (209) is rotatable arounda rotating axis (209-1) by the rotating motor (219) in response to acontrol signal received from the automated image capture controller(120) via the communication interface. Similarly, the tilting shaft(203) is rotatable by the tilting motor (213) around a tilting axis(203-1) in response to the control signal received from the automatedimage capture controller (120) via the communication interface. Inresponse to tilting the holder (221) around the tilting axis (203-1)and/or rotating the holder (221), collectively with the tilting shaft(203) and tilting motor (213), around the rotating axis (209-1), theorientation of the camera lens (220) may be adjusted. Accordingly, theFOV (220-1) of the camera lens (220) is adjusted according to theorientation of the camera lens (220). Although the example shown in FIG.3.1 is based on two motors associated with two mechanical shafts, otherexamples may be based on three motors associated with three mechanicalshafts without departing from the scope of the invention wherein thethird motor may be an additional rotating motor, such as the additionalrotating motor (331) with the additional rotating axis (209-2) shown inFIG. 3.2. Specifically, FIG. 3.2 shows a camera mobile device handheldgrip (800) with three motors as an example of the camera device holder(130) depicted in FIG. 1.2 above.

FIG. 4 shows an example of the light change pattern (124) of the lightsource (e.g., light source A (143 a), light source B (143 b)) depictedin FIGS. 1.1 and 1.2 above. As shown in FIG. 4, the horizontal axiscorresponds to time and the vertical axis corresponds to lightintensity. In particular, the light change pattern (124) is a pattern oflight intensity alternating between a bright level (400 a) and a darklevel (400 b) over time. For example, the bright level (400 a) of thelight intensity sustains over a time period A (410) and may be recurringover time with certain repetition rate. While the light intensityalternates between the bright level (400 a) and the dark level (400 b)over time, a sequence of IR images is captured by a camera deviceperiodically. For example, consecutive IR images in the sequence may becaptured at a time point A (401 a), time point B (401 b), time point C(401 c), etc. that are separate from each other by a time period B(420), time period C (430), etc. In particular, the time period A (410)encompasses at least one image capture time point, such as the timepoint B (401 b). The alternating sequence of dark level (400 b) capturedat time point A (401 a), bright level (400 a) captured at time point B(401 b), dark level (400 b) captured at time point C (401 c), etc. formsthe aforementioned local light change pattern captured by the cameradevice. Although the light change pattern (124) depicted in FIG. 4 is apattern of light intensity changes, the light change pattern (124) mayalso include wavelength changes in other examples. In other words, thebright level (400 a) and dark level (400 b) may be substituted orsupplemented by different wavelengths to represent wavelength changes.

The light change pattern (124) depicted in FIG. 4 may be extended alongthe time axis across a sequence of time points and IR images to definean object-identifying code. For example, the object-identifying code A(402 a) and object-identifying code B (402 b) are shown in FIG. 4 belowthe light change pattern (124) using a different time scale. In one ormore embodiments, the light intensity level and/or wavelength value ineach IR image defines a digital data bit. In other embodiments, thelight intensity level and/or wavelength value is constant across each ofa number of recurring sets of IR images where each IR image setcorresponds to a digital data bit. In other words, a digital data bitmay correspond to a single IR image or an IR image set. In each of theobject-identifying code A (402 a) and object-identifying code B (402 b),a distinct digital data bit pattern is delimited by a header (401) and atrailer (403). For example, the header (401) and trailer (403) may eachcontain 8 consecutive “zero” digital data bits. Inserted between theheader (401) and trailer (403), the object-identifying code A (402 a)includes a digital data bit pattern of “1010101010101010” while theobject-identifying code B (402 b) includes a digital data bit pattern of“1010010101011010”. Accordingly, the digital data bit pattern of“1010101010101010” and the digital data bit pattern of“1010010101011010” are used to identify or select two distinct lightsources attached to two distinct objects within the scene (140) depictedin FIGS. 1.1 and 1.2 above.

FIG. 5 shows an example of the sequence of IR images (126) of the scene(140) depicted in FIGS. 1.1 and 1.2 above. As shown in FIG. 5, thesequence of IR images (126) includes the IR image A (126 a), IR image B(126 b), IR image C (126 c), etc. that are captured at the time point A(401 a), time point B (401 b), time point C (401 c), etc. depicted inFIG. 4 above. According to the example of the light change pattern (124)described in reference to FIG. 4 above, the light source (e.g., lightsource A (143 a), light source B (143 b)) appears as an alternating darkand bright spot at a location marked “a1” or a location marked “a2” inthe IR image A (126 a), IR image B (126 b), IR image C (126 c), etc. Incontrast, the light intensity remains substantially constant at anotherlocation marked “b” in the IR image A (126 a), IR image B (126 b), IRimage C (126 c), etc. For example, the location marked “a1” may bedetermined by subtracting intensity values of corresponding pixels inthe IR image A (126 a) and IR image B (126 b) to generate thesubtraction result (126 d). Similarly, the location marked “a1” may befurther determined by subtracting intensity values of correspondingpixels in the IR image B (126 b) and IR image C (126 c) to generate thesubtraction result (126 d). In the subtraction result (126 d), blackcolor indicates no difference and white color indicates a non-zerodifference or a difference exceeding the aforementioned pre-determinedthreshold. Accordingly, the position of the light source (e.g., lightsource A (143 a)) corresponds to the white spot in the subtractionresult (126 d). In another example, the location marked “a2” may bedetermined in a similar manner to detect the location of a differentlight source (e.g., light source B (143 b)) within the IR images.

Further as shown in FIG. 5, the center of each IR image is defined asthe target position (127). Accordingly, the position offset or distancefrom the location marked “a1” to the target position (127) correspondsto the displacement (125). The location marked “a1”, the target position(127), and the displacement (125) shown in FIG. 5 are examples of thelocation A (126 b), target position (127), and displacement (125),respectively, depicted in FIG. 1.3 above. In one or more embodiments,the location marked “a1” varies between the IR image A (126 a), IR imageB (126 b), IR image C (126 c), etc. The rate of change of the locationmarked “a1” across IR image A (126 a), IR image B (126 b), IR image C(126 c), etc. corresponds to the movement parameter (128) depicted inFIG. 1.3 above. Although not explicitly shown, the displacement (125)and/or movement parameter (128) may also correspond to the locationmarked “a2” in a different example.

FIG. 6 shows an example of the sequence of IR images (126) described inreference to FIG. 4 above. In an example scenario, the target positionis the center of the IR image. As shown in FIG. 6, when theobject-identifying code A (402 a) depicted in FIG. 4 above is used forobject tracking, the light source A (143 a) is identified at a locationin the left portion of the IR images (e.g., IR image A (126 a)) in thesequence of IR images (126). In particular, the light source A (143 a)is reflective material included in a finger ring or part of a wrist bandworn by a male person (i.e., object A (142 a) as a main character). Forexample, the position of the light source A (143 a) is identified basedon the alternating dark and bright spot “a1” in the IR image A (126 a),IR image B (126 b), IR image C (126 c), etc. depicted in FIG. 5 above.In particular, the alternating dark and bright spot “a1” in the IR imageA (126 a), IR image B (126 b), IR image C (126 c), etc. exhibitstemporal and/or spatial variation that defines the object-identifyingcode A (402 a) associated with the light source A (143 a). Because thetarget position (i.e., image center) is to the right of the light sourcelocation, the object automated image capture controller (120) isconfigured to orient the camera device (110) toward the left such thatthe male person (i.e., object A (142 a) as the main character)holding/wearing the light source A (143 a) appears in the center of theIR image. Accordingly, using the object-identifying code A (402 a), theorientation of the camera device (110) is adjusted based on theidentified location “a1” of the light source A (143 a) such that theobject A (142 a) appears in the center of the IR image X (126 x).

Further as shown in FIG. 6, when the object-identifying code B (402 b)depicted in FIG. 4 above is used for object tracking, the light source B(143 b) is identified at a location in the left portion of the IR images(e.g., IR image A (126 a)) in the sequence of IR images (126). Inparticular, the light source B (143 b) is a finger ring or part of awrist band worn by a female person (i.e., object B (142 b) as asupporting character). For example, the position of the light source B(143 b) is identified based on the alternating dark and bright spot “a2”in the IR image A (126 a), IR image B (126 b), IR image C (126 c), etc.depicted in FIG. 5 above. In particular, the alternating dark and brightspot “a2” in the IR image A (126 a), IR image B (126 b), IR image C (126c), etc. exhibits temporal and/or spatial variation that defines theobject-identifying code B (402 b) associated with the light source B(143 b). Because the target position (i.e., image center) is to theright of the light source location, the object automated image capturecontroller (120) is configured to orient the camera device (110) towardthe left such that the female person (i.e., object B (142 b) as thesupporting character) holding/wearing the light source B (143 b) appearsin the center of the IR image. Accordingly, using the object-identifyingcode B (402 b), the orientation of the camera device (110) is adjustedbased on the identified location “a2” of the light source B (143 b) suchthat the object B (142 b) appears in the center of the IR image X (126x). By attaching different light sources having distinctobject-identifying codes to multiple objects in the scene, objecttracking may be switched expediently between different objects in thescene. For example, video recording may continue without disruptionwhile switching the tracked object from the male person to the femaleperson as described above.

To improve accuracy of object tracking, in addition to detecting thelocation of the reflective light source (143) based on the alternatingdark and bright spot in the IR image A (126 a), IR image B (126 b), IRimage C (126 c), etc. depicted in FIG. 5 above, the geometric shape ofthe alternating dark and bright spot is qualified based on matching thegeometric shape of the reflective material included in a finger ring orpart of a wrist band worn by a male person (i.e., object A (142 a)). Inother words, any alternating dark and bright spot in the IR image A (126a), IR image B (126 b), IR image C (126 c), etc. that does not match thegeometric shape of the reflective pattern is excluded in identifying thereflective light source (143).

Embodiments of the invention may be implemented on a computing system.Any combination of mobile, desktop, server, router, switch, embeddeddevice, or other types of hardware may be used. For example, as shown inFIG. 7.1, the computing system (700) may include one or more computerprocessors (702), non-persistent storage (704) (e.g., volatile memory,such as random access memory (RAM), cache memory), persistent storage(706) (e.g., a hard disk, an optical drive such as a compact disk (CD)drive or digital versatile disk (DVD) drive, a flash memory, etc.), acommunication interface (712) (e.g., Bluetooth interface, infraredinterface, network interface, optical interface, etc.), and numerousother elements and functionalities.

The computer processor(s) (702) may be an integrated circuit forprocessing instructions. For example, the computer processor(s) may beone or more cores or microcores of a processor. The computing system(700) may also include one or more input devices (710), such as atouchscreen, keyboard, mouse, microphone, touchpad, electronic pen, orany other type of input device.

The communication interface (712) may include an integrated circuit forconnecting the computing system (700) to a network (not shown) (e.g., alocal area network (LAN), a wide area network (WAN) such as theInternet, mobile network, or any other type of network) and/or toanother device, such as another computing device.

Further, the computing system (700) may include one or more outputdevices (708), such as a screen (e.g., a liquid crystal display (LCD), aplasma display, touchscreen, cathode ray tube (CRT) monitor, projector,or other display device), a printer, external storage, or any otheroutput device. One or more of the output devices may be the same ordifferent from the input device(s). The input and output device(s) maybe locally or remotely connected to the computer processor(s) (702),non-persistent storage (704), and persistent storage (706). Manydifferent types of computing systems exist, and the aforementioned inputand output device(s) may take other forms.

Software instructions in the form of computer readable program code toperform embodiments of the invention may be stored, in whole or in part,temporarily or permanently, on a non-transitory computer readable mediumsuch as a CD, DVD, storage device, a diskette, a tape, flash memory,physical memory, or any other computer readable storage medium.Specifically, the software instructions may correspond to computerreadable program code that, when executed by a processor(s), isconfigured to perform one or more embodiments of the invention.

The computing system (700) in FIG. 7.1 may be connected to or be a partof a network. For example, as shown in FIG. 7.2, the network (720) mayinclude multiple nodes (e.g., node X (722), node Y (724)). Each node maycorrespond to a computing system, such as the computing system shown inFIG. 7.1, or a group of nodes combined may correspond to the computingsystem shown in FIG. 7.1. By way of an example, embodiments of theinvention may be implemented on a node of a distributed system that isconnected to other nodes. By way of another example, embodiments of theinvention may be implemented on a distributed computing system havingmultiple nodes, where each portion of the invention may be located on adifferent node within the distributed computing system. Further, one ormore elements of the aforementioned computing system (700) may belocated at a remote location and connected to the other elements over anetwork.

Although not shown in FIG. 7.2, the node may correspond to a blade in aserver chassis that is connected to other nodes via a backplane. By wayof another example, the node may correspond to a server in a datacenter. By way of another example, the node may correspond to a computerprocessor or micro-core of a computer processor with shared memoryand/or resources.

The nodes (e.g., node X (722), node Y (724)) in the network (720) may beconfigured to provide services for a client device (726). For example,the nodes may be part of a cloud computing system. The nodes may includefunctionality to receive requests from the client device (726) andtransmit responses to the client device (726). The client device (726)may be a computing system, such as the computing system shown in FIG.7.1. Further, the client device (726) may include and/or perform all ora portion of one or more embodiments of the invention.

The computing system or group of computing systems described in FIGS.7.1 and 7.2 may include functionality to perform a variety of operationsdisclosed herein. For example, the computing system(s) may performcommunication between processes on the same or different system. Avariety of mechanisms, employing some form of active or passivecommunication, may facilitate the exchange of data between processes onthe same device. Examples representative of these inter-processcommunications include, but are not limited to, the implementation of afile, a signal, a socket, a message queue, a pipeline, a semaphore,shared memory, message passing, and a memory-mapped file.

The computing system in FIG. 7.1 may implement and/or be connected to adata repository. For example, one type of data repository is a database.A database is a collection of information configured for ease of dataretrieval, modification, re-organization, and deletion. DatabaseManagement System (DBMS) is a software application that provides aninterface for users to define, create, query, update, or administerdatabases.

The user, or software application, may submit a statement or query intothe DBMS. Then the DBMS interprets the statement. The statement may be aselect statement to request information, update statement, createstatement, delete statement, etc. Moreover, the statement may includeparameters that specify data, or data container (database, table,record, column, view, etc.), identifier(s), conditions (comparisonoperators), functions (e.g. join, full join, count, average, etc.), sort(e.g., ascending, descending), or others. The DBMS may execute thestatement. For example, the DBMS may access a memory buffer, a referenceor index a file for read, write, deletion, or any combination thereof,for responding to the statement. The DBMS may load the data frompersistent or non-persistent storage and perform computations to respondto the query. The DBMS may return the result(s) to the user or softwareapplication.

The above description of functions present only a few examples offunctions performed by the computing system of FIG. 7.1 and the nodesand/ or client device in FIG. 7.2. Other functions may be performedusing one or more embodiments of the invention.

While the invention has been described with respect to a limited numberof embodiments, those skilled in the art, having benefit of thisdisclosure, will appreciate that other embodiments can be devised whichdo not depart from the scope of the invention as disclosed herein.Accordingly, the scope of the invention should be limited only by theattached claims.

1. A method for image capture, comprising: generating, using a hardwareprocessor and by disposing a light sensing device at one or morelocations in a scene, a direction of a visible light source from each ofthe one or more locations in the scene; generating, based at least onthe direction of the visible light source and a pre-determined imagecapture criterion, a physical configuration of the image capture,wherein the physical configuration comprises at least one selected froma group consisting of a target camera location and a target objectlocation in the scene; and transmitting a command to a camera device tocapture an image of an object in the scene based on the physicalconfiguration of the image capture.
 2. The method of claim 1, furthercomprising: capturing, using the light sensing device and based on thephysical configuration of the image capture, a measure of lighting ofthe object; and determining, based on the measure of lighting of theobject, that the pre-determined image capture criterion is satisfiedprior to the image of the object to be captured, wherein thepre-determined image capture criterion comprises one of front lighting,side lighting, and back lighting.
 3. The method of claim 1, furthercomprising: determining, based on the pre-determined image capturecriterion, a target distance between the camera device and the object,wherein the physical configuration of the image capture is generatedbased at least on the target distance, and wherein the pre-determinedimage capture criterion comprises one of close-up, half-portrait,full-portrait, and wide-angle.
 4. The method of claim 3, furthercomprising: calculating a region-of-interest based on a dynamic model ofthe object; and calculating a subject ratio of the object appears in theimage captured by the camera device; wherein the target distance isdetermined based at least on the subject ratio.
 5. The method of claim1, wherein generating the direction of the visible light sourcecomprises: capturing, using the camera device disposed at a firstlocation of the one or more locations, a first plurality of images eachbeing a portion of a photo sphere centered at the first location; anddetecting, by the hardware processor based on the first plurality ofimages, the direction of the visible light source from the firstlocation, wherein the light sensing device is a visible light sensor ofthe camera device.
 6. The method of claim 1, wherein the commandspecifies at least one selected from a group consisting of disposing thecamera device at the target camera location and disposing the object atthe target object location.
 7. The method of claim 1, furthercomprising: capturing, using at least an infrared (IR) sensor of thecamera device, a second plurality of images of the scene; detecting, bya hardware processor based on a pattern of local light change across thesecond plurality of images, an IR light source attached to the object inthe scene; and determining, in response to detecting the IR lightsource, a location of the object in the scene, wherein transmitting thecommand to the camera device to capture the image of the object in thescene based on the physical configuration of the image capturecomprises: generating, based on the location of the object in the scene,control information for changing at least one selected from a groupconsisting of a field-of-view of the camera device and a crop field ofthe image such that the object appears in the image to substantiallyalign with a target position within at least one selected from the groupconsisting of the field-of-view of the camera device and the crop fieldof the image.
 8. The method of claim 7, further comprising: based on thecontrol information for changing the crop field of the image, croppingthe image.
 9. The method of claim 1, further comprising: generating alocation control signal to direct a moving platform to move to thetarget camera location; and generating an orientation control signal toorient the field-of-view of the camera device toward the object from thetarget camera location, wherein the camera device is disposed on themoving platform.
 10. The method of claim 9, wherein the moving platformis a robotic platform holding the camera device and driven by thelocation control signal to move to the target camera location.
 11. Themethod of claim 9, wherein the moving platform is a human user holdingthe camera device, wherein based on the location control signal, thecamera device outputs an instruction directing the human user to move tothe target camera location.
 12. The method of claim 1, whereintransmitting the command to the camera device to capture the image ofthe object in the scene based on the physical configuration of the imagecapture further comprises: generating an instruction directing theobject to move to the target camera location.
 13. The method of claim 7,wherein the IR light source is at least one selected from a groupconsisting of a local light emitter attached to the object and areflective region of the object emitting an object-reflected light inresponse to a remote light emitter separate from the object, wherein thepattern of local light change across the second plurality of images isproduced by at least one selected from a group consisting of the locallight emitter, the remote light emitter, and a geometric reflectionpattern of the reflective region, and wherein the pattern of local lightchange comprises at least one selected from a group consisting of alight intensity change, a light wavelength change, a repetition rate ofthe light intensity change, and a repetition rate of the lightwavelength change.
 14. The method of claim 1, wherein the target cameralocation is one of a plurality of target camera locations, wherein theimage of the object is one of a plurality of images that are capturedfrom the plurality of target camera locations to form a portion of amovie recording, and wherein each of the plurality of images is assigneda corresponding pre-determined image capture criterion specified by amovie script.
 15. The method of claim 14, further comprising: analyzingthe movie script to determine a plurality of pre-determined imagecapture criteria; generating, based at least on the plurality ofpre-determined image capture criteria, the plurality of target cameralocations in the scene; and transmitting a command to the image cameradevice disposed at the plurality of target camera locations in the sceneto capture the portion of the movie recording.
 16. The method of claim15, wherein the object corresponds to a first body of a main characterand a second body of a supporting character of the movie recording,wherein the IR light source comprises: a first IR light source attachedto the first body and configured to produce an object-identifying codeidentifying the main character, and a second IR light source attached tothe second body and configured to produce the object-identifying codeidentifying the supporting character, wherein generating the pluralityof target camera locations in the scene is further based on theobject-identifying code, and wherein each of the plurality ofpre-determined image capture criteria comprises: the target position anda target size of a visible portion of at least one body selected from agroup consisting of the first body and the second body in the image ofthe object; and one of front lighting, side lighting, and back lightingof the at least one human body.
 17. An image capture controller,comprising: a computer processor; and a memory coupled to the computerprocessor, the memory storing instructions, when executed, causing thecomputer processor to: generate, by disposing a light sensing device atone or more locations in a scene, a direction of a visible light sourcefrom each of the one or more locations in the scene; generate, based atleast on the direction of the visible light source and a pre-determinedimage capture criterion, a physical configuration of the image capture,wherein the physical configuration comprises at least one selected froma group consisting of a target camera location and a target objectlocation in the scene; and transmit a command to a camera device tocapture an image of an object in the scene based on the physicalconfiguration of the image capture.
 18. The image capture controller ofclaim 17, the instructions, when executed, further causing the computerprocessor to: capture, using the light sensing device and based on thephysical configuration of the image capture, a measure of lighting ofthe object; and determine, based on the measure of lighting of theobject, that the pre-determined image capture criterion is satisfiedprior to the image of the object to be captured, wherein thepre-determined image capture criterion comprises one of front lighting,side lighting, and back lighting.
 19. The image capture controller ofclaim 17, the instructions, when executed, further causing the computerprocessor to: determine, based on the pre-determined image capturecriterion, a target distance between the camera device and the object,wherein the physical configuration of the image capture is generatedbased at least on the target distance, and wherein the pre-determinedimage capture criterion comprises one of close-up, half-portrait,full-portrait, and wide-angle.
 20. The image capture controller of claim19, the instructions, when executed, further causing the computerprocessor to: calculate a region-of-interest based on a dynamic model ofthe object; and calculate a subject ratio of the object appears in theimage captured by the camera device; wherein the target distance isdetermined based at least on the subject ratio.
 21. The image capturecontroller of claim 17, wherein generating the direction of the visiblelight source comprises: capturing, using the camera device disposed at afirst location of the one or more locations, a first plurality of imageseach being a portion of a photo sphere centered at the first location;and detecting, by the hardware processor based on the first plurality ofimages, the direction of the visible light source from the firstlocation, wherein the light sensing device is a visible light sensor ofthe camera device.
 22. The image capture controller of claim 17, whereinthe command specifies at least one selected from a group consisting ofdisposing the camera device at the target camera location and disposingthe object at the target object location.
 23. The image capturecontroller of claim 17, the instructions, when executed, further causingthe computer processor to: capture, using at least an infrared (IR)sensor of the camera device, a second plurality of images of the scene;detect, based on a pattern of local light change across the secondplurality of images, an IR light source attached to the object in thescene; and determine, in response to detecting the IR light source, alocation of the object in the scene, wherein transmitting the command tothe camera device to capture the image of the object in the scene basedon the physical configuration of the image capture comprises:generating, based on the location of the object in the scene, controlinformation for changing at least one selected from a group consistingof a field-of-view of the camera device and a crop field of the imagesuch that the object appears in the image to substantially align with atarget position within at least one selected from the group consistingof the field-of-view of the camera device and the crop field of theimage.
 24. The image capture controller of claim 17, the instructions,when executed, further causing the computer processor to: generate alocation control signal to direct a moving platform to move to thetarget camera location; and generate an orientation control signal toorient the field-of-view of the camera device toward the object from thetarget camera location, wherein the camera device is disposed on themoving platform.
 25. The image capture controller of claim 24, whereinthe moving platform is a robotic platform holding the camera device anddriven by the location control signal to move to the target cameralocation.
 26. The image capture controller of claim 24, wherein themoving platform is a human user holding the camera device, wherein basedon the location control signal, the camera device outputs an instructiondirecting the human user to move to the target camera location.
 27. Theimage capture controller of claim 17, wherein transmitting the commandto the camera device to capture the image of the object in the scenebased on the physical configuration of the image capture furthercomprises: generating an instruction directing the object to move to thetarget camera location.
 28. The image capture controller of claim 23,wherein the IR light source is at least one selected from a groupconsisting of a local light emitter attached to the object and areflective region of the object emitting an object-reflected light inresponse to a remote light emitter separate from the object, wherein thepattern of local light change across the second plurality of images isproduced by at least one selected from a group consisting of the locallight emitter, the remote light emitter, and a geometric reflectionpattern of the reflective region, and wherein the pattern of local lightchange comprises at least one selected from a group consisting of alight intensity change, a light wavelength change, a repetition rate ofthe light intensity change, and a repetition rate of the lightwavelength change.
 29. The image capture controller of claim 17, whereinthe target camera location is one of a plurality of target cameralocations, wherein the image of the object is one of a plurality ofimages that are captured from the plurality of target camera locationsto form a portion of a movie recording, and wherein each of theplurality of images is assigned a corresponding pre-determined imagecapture criterion specified by a movie script.
 30. The image capturecontroller of claim 29, further configured to: analyze a movie script todetermine a plurality of pre-determined image capture criteria;generate, based at least on the plurality of pre-determined imagecapture criteria, the plurality of target camera locations in the scene;and transmit a command to the image camera device disposed at theplurality of target camera locations in the scene to capture the portionof the movie recording.
 31. The image capture controller of claim 30,wherein the object corresponds to a first body of a main character and asecond body of a supporting character of the movie recording, whereinthe IR light source comprises: a first IR light source attached to thefirst body and configured to produce an object-identifying codeidentifying the main character, and a second IR light source attached tothe second body and configured to produce the object-identifying codeidentifying the supporting character, wherein generating the pluralityof target camera locations in the scene is further based on theobject-identifying code, and wherein each of the plurality ofpre-determined image capture criteria comprises: the target position anda target size of a visible portion of at least one body selected from agroup consisting of the first body and the second body in the image ofthe object; and one of front lighting, side lighting, and back lightingof the at least one human body.
 32. A system for image capture,comprising: a light sensing device; a camera device; and an imagecapture controller configured to: generate, by disposing the lightsensing device at one or more locations in a scene, a direction of avisible light source from each of the one or more locations in thescene; generate, based at least on the direction of the visible lightsource and a pre-determined image capture criterion, a physicalconfiguration of the image capture, wherein the physical configurationcomprises at least one selected from a group consisting of a targetcamera location and a target object location in the scene; and transmita command to a camera device to capture an image of an object in thescene based on the physical configuration of the image capture.
 33. Thesystem of claim 32, the image capture controller further configured to:capture, using the light sensing device and based on the physicalconfiguration of the image capture, a measure of lighting of the object;and determine, based on the measure of lighting of the object, that thepre-determined image capture criterion is satisfied prior to the imageof the object to be captured, wherein the pre-determined image capturecriterion comprises one of front lighting, side lighting, and backlighting.
 34. The system of claim 32, the image capture controllerfurther configured to: determine, based on the pre-determined imagecapture criterion, a target distance between the camera device and theobject, wherein the physical configuration of the image capture isgenerated based at least on the target distance, and wherein thepre-determined image capture criterion comprises one of close-up,half-portrait, full-portrait, and wide-angle.
 35. The system of claim34, the image capture controller further configured to: calculate aregion-of-interest based on a dynamic model of the object; and calculatea subject ratio of the object appears in the image captured by thecamera device; wherein the target distance is determined based at leaston the subject ratio.
 36. The system of claim 32, wherein generating thedirection of the visible light source comprises: capturing, using thecamera device disposed at a first location of the one or more locations,a first plurality of images each being a portion of a photo spherecentered at the first location; and detecting, by the hardware processorbased on the first plurality of images, the direction of the visiblelight source from the first location, wherein the light sensing deviceis a visible light sensor of the camera device.
 37. The system of claim32, wherein the command specifies at least one selected from a groupconsisting of disposing the camera device at the target camera locationand disposing the object at the target object location.
 38. The systemof claim 32, the image capture controller further configured to:capture, using at least an infrared (IR) sensor of the camera device, asecond plurality of images of the scene; detect, based on a pattern oflocal light change across the second plurality of images, an IR lightsource attached to the object in the scene; and determine, in responseto detecting the IR light source, a location of the object in the scene,wherein transmitting the command to the camera device to capture theimage of the object in the scene based on the physical configuration ofthe image capture comprises: generating, based on the location of theobject in the scene, control information for changing at least oneselected from a group consisting of a field-of-view of the camera deviceand a crop field of the image such that the object appears in the imageto substantially align with a target position within at least oneselected from the group consisting of the field-of-view of the cameradevice and the crop field of the image.
 39. The system of claim 32, theimage capture controller further configured to: generate a locationcontrol signal to direct a moving platform to move to the target cameralocation; and generate an orientation control signal to orient thefield-of-view of the camera device toward the object from the targetcamera location, wherein the camera device is disposed on the movingplatform.
 40. The system of claim 39, wherein the moving platform is arobotic platform holding the camera device and driven by the locationcontrol signal to move to the target camera location.
 41. The system ofclaim 39, wherein the moving platform is a human user holding the cameradevice, wherein based on the location control signal, the camera deviceoutputs an instruction directing the human user to move to the targetcamera location.
 42. The system of claim 32, wherein transmitting thecommand to the camera device to capture the image of the object in thescene based on the physical configuration of the image capture furthercomprises: generating an instruction directing the object to move to thetarget camera location.
 43. The system of claim 38, wherein the IR lightsource is at least one selected from a group consisting of a local lightemitter attached to the object and a reflective region of the objectemitting an object-reflected light in response to a remote light emitterseparate from the object, wherein the pattern of local light changeacross the second plurality of images is produced by at least oneselected from a group consisting of the local light emitter, the remotelight emitter, and a geometric reflection pattern of the reflectiveregion, and wherein the pattern of local light change comprises at leastone selected from a group consisting of a light intensity change, alight wavelength change, a repetition rate of the light intensitychange, and a repetition rate of the light wavelength change.
 44. Thesystem of claim 32, wherein the target camera location is one of aplurality of target camera locations, wherein the image of the object isone of a plurality of images that are captured from the plurality oftarget camera locations to form a portion of a movie recording, andwherein each of the plurality of images is assigned a correspondingpre-determined image capture criterion specified by a movie script. 45.The system of claim 44, the image capture controller further configuredto: analyze a movie script to determine a plurality of pre-determinedimage capture criteria; generate, based at least on the plurality ofpre-determined image capture criteria, the plurality of target cameralocations in the scene; and transmit a command to the image cameradevice disposed at the plurality of target camera locations in the sceneto capture the portion of the movie recording.
 46. The system of claim45, wherein the object corresponds a first body of a main character anda second body of a supporting character of the movie recording, whereinthe IR light source comprises: a first IR light source attached to thefirst body and configured to produce an object-identifying codeidentifying the main character, and a second IR light source attached tothe human body and configured to produce the object-identifying codeidentifying the supporting character, wherein generating the pluralityof target camera locations in the scene is further based on theobject-identifying code, and wherein each of the plurality ofpre-determined image capture criteria comprises: the target position anda target size of a visible portion of at least one body selected from agroup consisting of the first body and the second body in the image ofthe object; and one of front lighting, side lighting, and back lightingof the at least one human body.
 47. A non-transitory computer readablemedium storing instructions for image capture, the instructions, whenexecuted by a computer processor, comprising functionality for:generating, by disposing a light sensing device at one or more locationsin a scene, a direction of a visible light source from each of the oneor more locations in the scene; generating, based at least on thedirection of the visible light source and a pre-determined image capturecriterion, a physical configuration of the image capture, wherein thephysical configuration comprises at least one selected from a groupconsisting of a target camera location and a target object location inthe scene; and transmitting a command to a camera device to capture animage of an object in the scene based on the physical configuration ofthe image capture.